Grouping Synonyms by Definitions
نویسندگان
چکیده
We present a method for grouping the synonyms of a lemma according to its dictionary senses. The senses are defined by a large machine readable dictionary for French, the TLFi (Trésor de la langue française informatisé) and the synonyms are given by 5 synonym dictionaries (also for French). To evaluate the proposed method, we manually constructed a gold standard where for each (word, definition) pair and given the set of synonyms defined for that word by the 5 synonym dictionaries, 4 lexicographers specified the set of synonyms they judge adequate. While inter-annotator agreement ranges on that task from 67% to at best 88% depending on the annotator pair and on the synonym dictionary being considered, the automatic procedure we propose scores a precision of 67% and a recall of 71%. The proposed method is compared with related work namely, word sense disambiguation, synonym lexicon acquisition and WordNet con-
منابع مشابه
A method for grouping synonyms
Because the Princeton WordNet has proved a valuable resource in NLP, many approaches have been developed to support the automatic creation of WordNets for languages other than English. In this paper, we present a method for grouping synonyms and definitions which we believe, can provide the basis for a merge approach to WordNet creation, that is an approach which starts by defining synsets (gro...
متن کاملAcquiring meaning for French medical terminology: contribution of morphosemantics
Morphologically complex words, and particularly neoclassical compounds, form more than 60% of the neologisms in the biomedical field. Guessing their definitions and grouping them into semantic classes by means of lexical relations are thus two crucial improvements for handling these words, e.g., for information retrieval, indexing and text understanding applications. This paper describes a morp...
متن کاملExtracting Synonyms from Dictionary Definitions
Automatic extraction of synonyms and/or semantically related words has various applications in Natural Language Processing (NLP). There are currently two mainstream extraction paradigms, namely, lexicon-based and distributional approaches. The former usually suffers from low coverage, while the latter is only able to capture general relatedness rather than strict synonymy. In this paper, two ru...
متن کاملResearch on NLP for RE at Fraunhofer FKIE: A Report on Grouping Requirements
In this report we describe the previous research done by our institute in the field of requirement analysis using different natural language processing methods. To represent the different degrees of similarity between words we implemented different methods that make use of synonyms and hyperonyms. We present the strengths of our methods and identify their weaknesses. For our future research we ...
متن کاملBiosom: gene synonym analysis by self-organizing map.
There are several guidelines for gene nomenclature, but they are not always applied to the names of newly identified genes. The lack of standardization in naming genes generates inconsistent databases with errors such as genes with the same function and different names, genes with different functions and the same name, and use of an abbreviated name. This paper presents a methodology for predic...
متن کامل